The PROOF Distributed Parallel Analysis Framework based on ROOT
نویسندگان
چکیده
The development of the Parallel ROOT Facility, PROOF, enables a physicist to analyze and understand much larger data sets on a shorter time scale. It makes use of the inherent parallelism in event data and implements an architecture that optimizes I/O and CPU utilization in heterogeneous clusters with distributed storage. The system provides transparent and interactive access to gigabytes today. Being part of the ROOT framework PROOF inherits the benefits of a performant object storage system and a wealth of statistical and visualization tools. This paper describes the key principles of the PROOF architecture and the implementation of the system. We will illustrate its features using a simple example and present measurements of the scalability of the system. Finally we will discuss how PROOF can be interfaced and make use of the different Grid solutions.
منابع مشابه
PROOF as a Service on the Cloud: a Virtual Analysis Facility based on the CernVM ecosystem
PROOF, the Parallel ROOT Facility, is a ROOT-based framework which enables interactive parallelism for event-based tasks on a cluster of computing nodes. Although PROOF can be used simply from within a ROOT session with no additional requirements, deploying and configuring a PROOF cluster used to be not as straightforward. Recently great efforts have been spent to make the provisioning of gener...
متن کاملStudy of Solid State Drives performance in PROOF distributed analysis system
Solid State Drives (SSD) is a promising storage technology for High Energy Physics parallel analysis farms. Its combination of low random access time and relatively high read speed is very well suited for situations where multiple jobs concurrently access data located on the same drive. It also has lower energy consumption and higher vibration tolerance than Hard Disk Drive (HDD) which makes it...
متن کاملPROOF on Demand
The Parallel ROOT [1] Facility, PROOF [2], is an extension of ROOT enabling interactive analysis of large sets of ROOT files in parallel. PROOF on Demand, PoD [3], is a set of utilities, which allows starting a PROOF cluster at user request on any resource management system. Installation is simple and doesn’t require administrator privileges, and all the processes run in user space. PoD gives u...
متن کاملAN OPTIMAL FUZZY SLIDING MODE CONTROLLER DESIGN BASED ON PARTICLE SWARM OPTIMIZATION AND USING SCALAR SIGN FUNCTION
This paper addresses the problems caused by an inappropriate selection of sliding surface parameters in fuzzy sliding mode controllers via an optimization approach. In particular, the proposed method employs the parallel distributed compensator scheme to design the state feedback based control law. The controller gains are determined in offline mode via a linear quadratic regular. The particle ...
متن کاملOptimizing Neural Network Classifiers with ROOT on a Rocks Linux Cluster
We present a study to optimize multi-layer perceptron (MLP) classification power with a Rocks Linux cluster [1]. Simulated data from a future high energy physics experiment at the Large Hadron Collider (LHC) is used to teach a neural network to separate the Higgs particle signal from a dominant background [2]. The MLP classifiers have been implemented using the ROOT data analysis framework [3]....
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003